Community-based Link Prediction with Text

نویسندگان

  • David Mimno
  • Hanna Wallach
  • Andrew McCallum
چکیده

There has been much recent interest in generative models for graphs. The intuition behind the study of such link prediction functions is that they provide a succinct description of the process by which networks grow and evolve: a model that accurately predicts small-scale actions such as coauthorships should help us understand the global properties of the network. Previous work in social network analysis, such as LibenNowell and Kleinberg [5], has often focused on generative models that take into account only the graph structure of the network, without making any use of the individual properties of the nodes themselves. Frequently, however, much richer data is available than the link structure alone, such as text documents for coauthorship networks. In this paper, we propose a generative model for documents that produces both text and authors based on a notion of communities, which each have a distribution over authors and over topics. We demonstrate this model on the proceedings of the NIPS conference, showing improved likelihood of held-out coauthorship data. Discovering latent structure can also be useful in analyzing long term trends, such as the growth and fragmentation of communities.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Link Prediction using Network Embedding based on Global Similarity

Background: The link prediction issue is one of the most widely used problems in complex network analysis. Link prediction requires knowing the background of previous link connections and combining them with available information. The link prediction local approaches with node structure objectives are fast in case of speed but are not accurate enough. On the other hand, the global link predicti...

متن کامل

Community Detection and Link Prediction for Visual Genome Image Object-to-Object Relationships

This project explores various network representations of the Visual Genome annotated image dataset. The relational structure of the items within the dataset’s images lends itself well to a network representation consisting of three node types—entities, predicates, and attributes—and we perform community detection and relationship / link prediction on this tripartite graph. Relationship predicti...

متن کامل

A Link Prediction Method Based on Learning Automata in Social Networks

Nowadays, online social networks are considered as one of the most important emerging phenomena of human societies. In these networks, prediction of link by relying on the knowledge existing of the interaction between network actors provides an estimation of the probability of creation of a new relationship in future. A wide range of applications can be found for link prediction such as electro...

متن کامل

Community Detection Based on Link Prediction Methods

Community detection and link prediction are both of great significance in network analysis, which provide very valuable insights into topological structures of the network from different perspectives. In this paper, we propose a novel community detection algorithm with inclusion of link prediction, motivated by the question whether link prediction can be devoted to improving the accuracy of com...

متن کامل

A Document Weighted Approach for Gender and Age Prediction Based on Term Weight Measure

Author profiling is a text classification technique, which is used to predict the profiles of unknown text by analyzing their writing styles. Author profiles are the characteristics of the authors like gender, age, nativity language, country and educational background. The existing approaches for Author Profiling suffered from problems like high dimensionality of features and fail to capture th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007